Rank in Wordlist | Frequency | Word |
---|---|---|
7468 | 38 | 1,000 |
11988 | 19 | 1,200 |
13397 | 16 | 1,500 |
14068 | 15 | 50,000 |
14709 | 14 | 1,050 |
14712 | 14 | 100,000 |
15548 | 13 | 30,000 |
16359 | 12 | 15,000 |
16391 | 12 | 2,000 |
17341 | 11 | 1,100 |
Rank in Wordlist | Frequency | Word |
---|---|---|
12946 | 17 | f(x |
21715 | 8 | آپ(ص |
26366 | 6 | f′(x |
34429 | 4 | O(n |
40283 | 4 | یوگریٹک(یوگارت |
41400 | 3 | A(D |
41808 | 3 | E(G |
41841 | 3 | F(x |
42929 | 3 | V(D |
42930 | 3 | V(G |
Rank in Wordlist | Frequency | Word |
---|---|---|
4916 | 70 | ہے)۔ |
12869 | 18 | ہیں)۔ |
13625 | 16 | جون)، |
13653 | 16 | دسمبر)، |
13689 | 16 | سال)، |
15488 | 14 | ہے)، |
20358 | 9 | تھا)۔ |
20995 | 9 | نہیں)۔ |
22840 | 8 | میں)، |
23745 | 7 | hatched).svg |
Rank in Wordlist | Frequency | Word |
---|---|---|
29343 | 5 | 80% |
29380 | 5 | 95% |
33378 | 4 | %20 |
33682 | 4 | 20% |
33744 | 4 | 5% |
33789 | 4 | 60% |
40387 | 3 | %40 |
40388 | 3 | %50 |
40389 | 3 | %55 |
40390 | 3 | %7 |
Rank in Wordlist | Frequency | Word |
---|---|---|
41408 | 3 | AT&T |
99170 | 1 | Computer-Aided-Designing-&-Drafting |
117950 | 1 | aew&c |
192465 | 1 | ۴۰S&W |
Rank in Wordlist | Frequency | Word |
---|---|---|
51155 | 2 | $50 |
82423 | 1 | $200,000 |
82424 | 1 | $60 |
82425 | 1 | $AU |
82426 | 1 | $، |
96054 | 1 | A$ |
98349 | 1 | C$ |
174775 | 1 | ٤٥٠$ |
Rank in Wordlist | Frequency | Word |
---|---|---|
4623 | 75 | People's |
15589 | 13 | Côte-d'Or |
16474 | 12 | L'Islet |
18591 | 10 | D'Autray |
23540 | 7 | Côtes-d'Armor |
23657 | 7 | Saint-Clair-sur-l'Elle |
28989 | 6 | ہے'۔ |
29588 | 5 | L'Île-d'Orléans |
29659 | 5 | Pays-d'en-Haut |
29673 | 5 | Pont-l'Évêque |
Rank in Wordlist | Frequency | Word |
---|---|---|
9932 | 26 | ٹوئنٹی/20 |
17532 | 11 | http://ecp |
21665 | 8 | http://en |
22934 | 8 | ١٧/١٤ |
23377 | 7 | 17/14 |
23566 | 7 | I/V |
23747 | 7 | http://hamzaurduarchive |
26307 | 6 | TCP/IP |
29374 | 5 | 9/11 |
29920 | 5 | https://www |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots